Lexical Enrichment of Biomedical Ontologies
نویسندگان
چکیده
aBstRact This chapter is concerned with lexical enrichment of ontologies, that is how to enrich a given ontology with lexical information derived from a semantic lexicon such as WordNet or other lexical resources. The authors present an approach towards the integration of both types of resources, in particular for the human anatomy domain as represented by the Foundational Model of Anatomy and for the molecular biology domain as represented by an ontology of biochemical substances. The chapter describes our approach on enriching these biomedical ontologies with information derived from WordNet and Wikipedia by matching ontology class labels to entries in WordNet and Wikipedia. In the first case the authors acquire WordNet synonyms for the ontology class label, whereas in the second case they acquire multilingual translations as provided by Wikipedia. A particular point of emphasis here is on selecting the appropriate interpretation of ambiguous ontology class labels through sense disambiguation, which we address by use of a simple algorithm that selects the most likely sense for an ambiguous term by statistical significance of co-occurring words in a domain corpus. Acquired synonyms and translations are added to the ontology by use of the LingInfo model, which provides an ontology-based lexicon model for the annotation of ontology classes with (multilingual) terms and their linguistic properties.
منابع مشابه
Prioritising lexical patterns to increase axiomatisation in biomedical ontologies. The role of localisation and modularity.
INTRODUCTION This article is part of the Focus Theme of METHODS of Information in Medicine on "Managing Interoperability and Complexity in Health Systems". OBJECTIVES In previous work, we have defined methods for the extraction of lexical patterns from labels as an initial step towards semi-automatic ontology enrichment methods. Our previous findings revealed that many biomedical ontologies c...
متن کاملLexical Characterisation of Bio-ontologies by the Inspection of Regularities in Labels
Abstract: Hundreds of biomedical ontologies have been produced, with many of the significant, widely used ones being developed in collaborative efforts and following a set of construction principles, which include using a systematic naming convention for their labels. Despite their success, many of these ontologies have lacked a foundation of axioms that would expose the wealth of knowledge in ...
متن کاملThe OntoEnrich platform: using workflows for quality assurance and axiomatic enrichment of ontologies
Ontologies are rich in natural language content, because it facilitates the understanding of the ontology to humans. Biomedical ontologies contain more human-facing content than that which is machine-processable—not all the natural language content in definitions is mirrored as logical axioms, which is how machines can understand ontologies. Consequently, the development of methods and tools ab...
متن کاملOntoEnrich: A Platform for the Lexical Analysis of Ontologies
The content of the labels in ontologies is usually considered hidden semantics, because the domain knowledge of such labels is not available as logical axioms in the ontology. The use of systematic naming conventions as best practice for the design of the content of the labels generates labels with structural regularities, namely, lexical regularities. The structure and content of such regulari...
متن کاملProposed SKOS Extensions for BioPortal Terminology Services
The National Center for Biomedical Ontology (NCBO) BioPortal provides common access for browsing and querying a large set of ontologies that are commonly used in biomedical communities. One of our missions is to align lexical features (i.e., textual definitions) that are commonly used in these ontologies across different representation formats with standard tags and to represent them in a stand...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015